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Discharge of house hold waste water into surface water may present a variety of concerns such as high biochemical oxygen demand, 
significant nutrient inputs, high suspended solids, ecosystem disturbance and health hazard due to potential pathogens. House hold 
waste water consists of black water (urine and faeces) and grey water from the kitchen, bathroom and washing machine. Bacteria are 
the most important group of microorganisms found in this biological contact process. Water plays a very important role in 
supporting all forms of life. However if contaminated, it has great potential for transmitting a variety of diseases and illnesses. The 
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objective of the work is to study the molecular characterization by isolating DNA from the selected organism. The conserved region 
was amplified and the amplified product was sequenced based on 16s rRNA. The fasta format of the sequence was submitted in 
BLAST for the similarity searches. The functional region was predicted. 


Keywords: Pertussis; Bordetella parapertussis; In-silico analysis; Bioinformatics. 


Abbreviations: HMM - Hidden Markov model; BLAST - Basic Local Alignment Search Tool; ORF - Open reading frame. 


1. INTRODUCTION 


Bordetella parapertussis is a small Gram-negative bacterium of the genus Bordetella which is adapted to colonize the mammalian 


respiratory tract. Pertussis caused by B. parapertussis manifests with similar symptoms to B. pertussis. Bordetella pertussis is the only 
organism of major clinical significance in this genus, causing whooping cough in infants and young children (Ashok Gadgil, 1998). 
However, a closely related organism, B. parapertussis can also cause a milder form of bronchitis. It is an extremely small, strictly 
aerobic, Gram negative, non-motile cocobacillus (short rod). Compared to other Bortdetella species, B. pertussis does not grow on 
common laboratory media. B. pertussis can be distinguished from B. parapertussis in that B. pertussis is oxidase positive but urease 
negative, while B. parapertussis is oxidase negative and urease positive. B. bronchosepticus is positive for both enzymes. Symptoms 
are characteristic (Blackall, 1994). Laboratory diagnosis is made by obtaining a nasopharyngeal aspirate and primary culture in 
Bordet-Gengou medium or potato-glycerol-blood agar. Growth is inhibited by peptones, unsaturated fatty acids, sulphides, etc. 
found in ordinary media. The organism grows as small transparent hemolytic colonies on blood agar. It can be serologically 
distinguished from B. parapertussis and B. bronchosepticus (Boddinghaus et al., 1990). 


>/tmp/readseq.in.1338 [Unknown form], 783 bases 
acttcggtctggtgeceagtgeceaacggetgagtaattgtatcggaacg 
tecccagtagcgeeegataactacecgaaagcetgectaataccecatac 


gccctacggeggaaagcggeggaccttcggecctcgcactattggagcge 
ccgatatcggattagctagttgetggeetaacgectcaccaaggcgacga 
tccgtagctggtttgagaggacgaccagccacactgggactgagacacgg 


cccagactcctacgggagecagcagtgegeaattttggacaatgeegeca 
accctgatccagccatcccgcetgtgcgatgaageccttcggettgtaaa 


gcacttttggcaggaaagaaacgececcegctaatacctgecectaatga 
cggtacctgcagaataagcaccggctaactacetgccagcagccgcegta 
atacgtagggtgcaagcettaatcggaattactggecetaaagcetgcec 
aggecgettcggaaagaaagatgtgaaatcccagagcttaactttggaact 
gcatttttaactaccgggctagagtetgtcagagegagetggaattccgc 
gtgtagcagtgaaatgcgtagatatgcggaggaacaccgatgecgaaggc 
agcctcctgggataacactgacgctcatgcacgaaagcetggegagcaaa 
caggattagataccctggtagtccacgccctaaacgatgtcaactagctg 
ttggggccttcggeccttagtagcgcagctaac 


Figure 1 
FASTA format of the sequence 


The GeneMark gene-prediction algorithm was developed in several steps. The first step was performed by a group at the 
Institute of Molecular Genetics in Moscow in 1986. It was demonstrated that markov chain models were useful tools for DNA 
sequence analysis and particularly for gene prediction. The GeneMark.hmm algorithm (Lukashin and Borodovsky, 1998) was 
designed to improve gene-prediction quality in terms of finding exact gene boundaries. GeneMark program identified a gene 
mainly as the open reading frame where the gene is residing. However, the 5’ boundary of the gene (the translation initiation codon 
associated with the protein amino terminus) might not be precisely predicted. GeneMark indicates several possible start codons and 
scores them. The underlying idea of GeneMark.hmm was to embed the GeneMark models for coding and noncoding regions into 
the naturally derived hidden Markov model (HMM) framework, with gene boundaries modeled as transitions between hidden states. 
BLAST (Basic Local Alignment Search Tool) is a web-based program that is able to align your search sequence to thousands of 
different sequences in a database (that you choose) and shows you a list of the top matches. This program can search through a 
database of thousands of entries in under a minute. BLAST performs its alignment by matching up each position of your search 
sequence to each position of the sequences in the database. For each position, BLAST gives a positive score if the nucleotides 
match. BLAST can also insert gaps when performing the alignment. Each gap inserted has a negative effect on the alignment score, 
but if enough nucleotides align as a result of the gap, this negative effect is overcome and the gap is accepted in the alignment 
(Malik, 2008). These scores are then used to calculate the alignment score in “bits” which is converted to the statistical E-value. A 
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high bit score correlates with a low E-value. The lower the E value, the more similar the sequence found in the data base is to your 
query sequence (Wu et al., 2001). The objective of this work is to predict the similarity search of the microbe, submit the sequence in 
gene bank, analyze the phylogenetic relationship between the species, predict the functional region of the sequence and predict the 
promoter site of the sequence for transcription. 


2. MATERIALS AND METHODS 

2.1. Sample Collection 

Water samples are collected in a sterile containers from the residential (cooking and washing) waste water. It was then plated on the 
nutrient agar medium for the growth of microbes. 


2.2. Identification of microbes 
Characterizations of the selected isolates are confirmed by various physiological and biochemical test. 


2.3. Sequence analysis 
DNA was isolated from Bordetella sp. It was then amplified and sequenced based on 16s rRNA. 


2.4. In-silico analysis 

Sequence was submitted in BLAST for similarity searches. Evolutionary and Phylogenetic relationship was identified using MSA, 
Phylip and ATV software. Functional region of the sequence was identified using ORF finder. Coding region of the sequence was 
identified using Gene Mark. Promoter site region of the sequence was identified using promoter finder. 


<> BLAST My NCBI a 


2 


Home Recent Results Saved Strategies Help [Sian In} [Reaister 


+ NCBI BLAST/ blastn suite/ Formatting Results - MTSWM97X015 
| Edit and Resubmit Save Search Strategies © >Eormatting options + >Download 


| Brindha AGGGGGCCAGCCACGGGACTTCGGTCTGGTGGCGAGTGGCGAACGGGTGAGTAATTGTATCGGAACGTGCCCAGTAGCGG 


| Query ID |d|43657 Database Name nr 


Description Brindha Description All GenBank+EMBL+DDBJ+PDB sequences (but no EST, STS, 
AGGGGGCCAGCCACGGGACTTCGGTCTGGTGGCGAGTGGCGAACGGGTGAGTAATTGTATCGGAACGTGCCCAGTAGGE&S&, environmental samples or phase 0, 1 or 2 HTGS sequences) 
Molecule type nucleic acid Program BLASTN 2.2.25+ > Citation 


| Query Length 1405 


Other reports: >Search Summary [Taxonomy reports] [Distance tree of results 


Sequences producing significant alignments: 


Uncultured Bordetella sp. clone Fijan.21 16S ribosomal RNA gene, pz 1664 2170 97% 0.0 96% 


GO416682.1 


F9658206.1 Uncultured bacterium clone Winter_MachineA&B_HEPAoff_2e11 16S 1 1659 2164 97% 0.0 96% 
GQ417158.1 Uncultured Bordetella sp. clone FiSapr.44 16S ribosomal RNA gene, | 1659 2164 97% 0.0 96% 
GO417010.1 Uncultured Bordetella sp. clone Floct.34 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GQ417009.1 Uncultured Bordetella sp. clone Floct.33 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GQ417003.1 Uncultured Bordetella sp. clone Floct.27 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GO417001.1 Uncultured Bordetella sp. clone Floct.25 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GQ416999.1 Uncultured Bordetella sp. clone Floct.23 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GQ416998.1 Uncultured Bordetella sp. clone Floct.22 16S ribosomal RNA gene, pi 1659 2164 97% 0.0 96% 
GOQ416970.1 Uncultured Bordetella sp. clone Flaug.52 16S ribosomal RNA gene, p 1659 2164 97% 0.0 96% 
GQ416969.1 Uncultured Bordetella sp. clone Flaug.51 16S ribosomal RNA gene, p 1659 2164 97% 0.0 96% 
GOQ416956.1 Uncultured Bordetella sp. clone Fiaug.38 16S ribosomal RNA gene, p 1659 2164 97% 0.0 96% 
GO416952.1 Uncultured Bordetella sp. clone Flaug.34 16S ribosomal RNA gene, p 1659 2164 97% 0.0 96% 
GQ416914.1 Uncultured Bordetella sp. clone Fijun.48 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GQ416913.1 Uncultured Bordetella sp. clone Fijun.47 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GO416866.1 Uncultured Bordetella sp. clone Flapr.54 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GOQ416861.1 — Uncultured Bordetella sp. clone Flapr.49 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GO416858.1 Uncultured Bordetella sp. clone Flapr.46 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GO416686.1 Uncultured Bordetella sp. clone Fijan.25 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GQ416683.1 Uncultured Bordetella sp. clone Fijan.22 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GOQ416679.1 Uncultured Bordetella sp. clone Fljan.18 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GOQ416678.1 Uncultured Bordetella sp. clone Fijan.17 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GQ416677.1 Uncultured Bordetella sp. clone Fijan.16 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GOQ416676.1 Uncultured Bordetella sp. clone Fijan.15 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 
GQ416675.1 Uncultured Bordetella sp. clone Fijan.14 16S ribosomal RNA gene, pz 1659 2164 97% 0.0 96% 


Figure 2 
Similarity search of the sequence using BLAST 


2.5. Genbank submission 
The isolated species was submitted in Genbank for the identification of new species. 


disc@@very 


3. RESULTS AND DISCUSSION 


3.1. Sequencing of amplified gene based on 16s rRNA 

Sequence analysis of the 16s rRNA sequences is done with the help of several primers called universal primers. These primers target 

the conserved region of the 16s rRNA gene and amplify the target in parts. Finally the several amplified parts could be assembled ow 

together to have the entire sequence. The fasta format of the sequence was shown in Fig.1. v, 
© 
rail 
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3.2. In - silico analysis of the sequence 
The fasta format sequence was submitted in blast for similarity searches (Fig.2). 


CLUSTAL 2.1 multiple sequence alignment 


gi 257073439|9b169417158.1| _ pene ey ae fee gi |257073439 | gb|GQ417158.1 GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCAG 429 
gi | 257073290 |gb|G9417009.1| e -ATTGAACGCTGGCGGGATGCITTACACATS 30 gi|257073290|gb1GQ417009.1 GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCAG 429 
gi| 257073291 |ab/6Q417010.1| _ -ATTGRACGCTGCCGGGATGCTTTACACATG 30 gi|257073291|gb1GQ417010.1 GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCAG 429 
911257072963 |gb160416682.1/ eae = -ATTGAACGCTGCCGGGATGCTTTACACATG 30 gi| 257072963 | gb|GQ416682.1 GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCGG 429 
gi |223955061|gb| F658206.11 AGAGITIGATCCTGGCTCAGATIGAACGCIGGCGGGATGCTTITACACATG 50 gi 223955061 |gb|FU658206.1 GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCAG 449 
Brindha --- -- -- ---AGGGGGCC-- =e Brindha GAAGGCCTICGGGTIGIAAAGCACTITIGGCAGGAAAGAAACGGCGCCGG 397 
gi|257073439|gb1GQ417158.11 CAAGTCGGACGGCAGCACGGACTICGGICTGGIGGCGAGIGGCGAACGGG 80 gi|257073439|gb|GQ417158.1 CTAATACCTGGCGCTAATGACGGTACCIGCAGAATAAGCACCGGCTAACT 479 
gi| 257073290 |gb/GQ417009.11 CAAGTCGGACGGCAGCACGGACTICGGICIGGIGGCGAGTGGCGAACGGS 80 gi|257073290|gb1GQ417009.1 CTAATACCIGGCGCTAATGACGGTACCTGCAGAATAAGCACCGGCTAACT 479 
911257073291 |gb|/6Q417010.1) CAAGTCGGACGGCAGCACGGACTICGGICIGGIGGCGAGIGGCGAACGGG 80 gil257073291 |gb|/GQ417010.1 CTAATACCTGGCGCTAATGACGGTACCTGCAGAATAAGCACCGGCTAACT 479 
gi1257072963 |gb1GQ416682.1| CAAGTCGGACEGCAGCACGGACTICGGICIGGIGGCGAGIGECGARCEGS 80 gi|257072963 |gb1GQ416682.1 CTAATACCTGGCGCTAATGACGGTACCIGCAGAATAAGCACCGGCIAACT 479 
a es erie i ag 1 game pan cncebandense anna Pt gi 223955061 |gb|FJ658206.1 CTAATACCTGGCGCTAATGACGGTACCTGCAGAATAAGCACCGGCTAACT 499 
ee ae GACT TOSGTCTesT ee Cen STCCCEN Cert Brindha CTAATACCIGGCGCTAATGACGGTACCIGCAGAATAAGCACCGGCTAACT 447 
JIG H HAASE adn nn niiddidoidiioienedididnane 
gil 257073439|gb1GQ417158.1| TGAGTAAT-GTATCGGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 129 
gi|257073290|gb/6Q417009.1) TGAGTAAT-GIATCGGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 129 gil 257073439 |gb|GQ417158.1 ACGTGCCAGCAGCCGCGGTAATACGTAGGGIGCAAGCGTTARTCGGAATT 529 
gil2570732911gb|GQ417010.11 TGAGTAAT-GIATCGGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 129 gi|257073290|gb/GQ417009.1 ACGTGCCAGCAGCCGCGGTARTACGTAGGGIGCAAGCGTIAATCGGAATT 529 
gi| 257072963 |ab1GQ416682.1] TGAGTAAT-GIATCGGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 129 gi 257073291 |gb|GQ417010.1 ACGIGCCAGCAGCCGCGGTAATACGTAGGGIGCAAGCGTTAATCGGAATT 529 
gi| 223955061 |gb|F658206.11 TGAGTAAT-GTATCAGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 149 gi| 257072963 |gb|GQ416682.1 ACGTGCCAGCAGCCGCGGTAATACGTAGGGIGCAAGCGITAATCGGAATT 529 
Brindha TGAGTAATIGIATCGGAACGIGCCCAGTAGCGGGGGATAACTACGCGAAA 97 gi|223955061|gb| FU658206.1 ACGIGCCAGCAGCCGCGGTAATACGIAGGGIGCAAGCGTTAATCGGAATT 549 
Hintianee Hedin ubinudiinddnniddnndinnennidtinns iin Brindha ACGTGCCAGCAGCCGCGGTAATACGIAGGGIGCAAGCGITAATCGGAATT 497 
peprrrerrreverrnrrrerrrrrrrrrrrrrrr rr rrrrrrrrrrtrn 
gi 257073439 |ab/6Q417158.11 GCGTGGCTAATACCGCATACGCCCTACGGGGGAAAGCGGGGGACCTTCGS 179 
gi 1257073290 | gb1GQ417009.11 GCGTIGGCTAATACCGCATACGCCCTACGGGGGAAAGCGGGGGACCTICGG 179 gi 257073439 |gb|/GQ417158.1 ACTOCECCTAAACCCTSCCCACECECTICOSAAACAAAGATGTCRAATCC 579 
eee Sea eee eee pike gi 257073290 |gb|/GQ417009.1 ACTGGGCGTAAAGCGTGCGCAGGCGGTICGGAAAGRAAGATGIGAAATCC 579 
gi dl é 5 
gi| 223955061 |gb|Fu658206.1| GCGTGGCTAATACCGCATACGCCCTACGGGGGAAAGCGGGGGACCTICGS 199 dt cbt rode cpl ee eer ote 
eam pe ey paneer ee gi|257072963|gb|GQ416682.1 ACTGGGCGTAAAGCGTGCGCAGGCGGTICGGAAAGAAAGATGIGAAATCC 579 
aiivuaieudatousiauuakas bcauedetasensuatecatananace gi|223955061|gb| FU658206.1 ACTGGGCGTAAAGCGTGCGCAGGCGGTICGGAAAGARAGATGIGAAATCC 599 


gi|257073439|gb|GQ417158.1 
gil257073290|1gb1GQ417009.1 
gi|257073291|gb|GQ417010.1 
gi] 257072963 |gb|GQ416682.1 
gi|223955061|gb| FJ658206.1 
Brindha 


gi|257073439|gb|GQ417158.1 
gi|257073290|gb/GQ417009.1 
gi|257073291|gb|GQ417010.1 
gil 257072963 |gb/GQ416682.1 
gil 223955061 |gb| FJ658206.1 
Brindha 


ro 


gi|257073439|gb|/GQ417158.1 
gil257073290|gb1GQ417009.1 
gi|257073291|gb|GQ417010.1 
gil 257072963 |gb1GQ416682.1 
gil 223955061|gb| FU656206.1 
Brindha 


gil 257073439 |gb/GQ417158.1 
gil 257073290 |gb1GQ417009.1 
gi|257073291|gb|GQ417010.1 
gi|257072963|gb1GQ416682.1 
gil 223955061|gb|FJ656206.1 
Brindha 


Figure 3 


GCCICGCACTATIGGAGCGGCCGATATCGGATIAGCTAGTIGGIGGGGIA 
GCCICGCACTATIGGAGCGGCCGATATCGGATTAGCTAGTIGGIGGGGIA 
GCCICGCACTATIGGAGCGGCCGATATCGGATTAGCIAGIIGGIGGGGIA 
GCCICGCACTATIGGAGCGGCCGATATCGGATTAGCTAGTIGGIGGGGIA 
GCCTCGCACTATIGGAGCGGCCGATATCGGATTAGCIAGTIIGGIGGGGTA 
GCCICGCACTATIGGAGCGGCCGATATCGGATIAGCIAGTIGGIGGGGIA 


HEAEERERAAHEEEHEHEERAEEE REE E EERE ERE REE 


ACGGCTCACCAAGGCGACGATCCGTAGCIGGITIGAGAGGACGACCAGCC 
ACGGCTCACCAAGGCGACGATCCGTAGCIGGITIGAGAGGACGACCAGCC 
ACGGCTCACCAAGGCGACGATCCGIAGCIGGITIGAGAGGACGACCAGCC 
ACGGCTCACCAAGGCGACGATCCGTAGCIGGITIGAGAGGACGACCAGCC 
ACGGCTCACCAAGGCGACGATCCGTAGCIGGITIGAGAGGACGACCAGCC 
ACGGCTCACCAAGGCGACGATCCGTAGCIGGITIGAGAGGACGACCAGCC 


eetetttttttt titi ttt titi t tii t itt ttre iris 


ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGIGGGS 
ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGIGGGG 
ACACTGGGACIGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGIGGGG 
ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGTGGGG 
ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGIGGGG 
ACACTGGGACTGAGACACGGCCCAGACTCCTACGGGAGGCAGCAGIGGGG 


FERERAEARERA RARER REAR EERE RRR RR RRR RE 


AATITIGGACAATGGGGGCAACCCTGATCCAGCCATCCCGCGIGIGCGAT 
AATITIGGACAATGGGGGCAACCCIGATCCAGCCATCCCGCGIGIGCGAT 
AATTITIGGACAATGGGGGCAACCCIGATCCAGCCATCCCGCGIGIGCGAT 
AATTITIGGACAATGGGGGCAACCCIGATCCAGCCATCCCGCGIGIGCGAT 
AATITIGGACAATGGGGGCAACCCIGATCCAGCCATCCCGCGIGIGCGAT 
AATITIGGACAATGGGGGCAACCCIGATCCAGCCATCCCGCGIGIGCGAT 


ertttttttttitttttrtttttittittt iii i titi i tii t iris) 


229 
229 
229 
229 
249 
197 


279 
279 
279 
279 
299 
247 


329 
329 
329 
329 
349 
297 


379 
379 
379 
379 
399 
347 


Evolutionary relationships between the species using clustalW 


Brindha 


gi! 257073439|gb/GQ417158.1) 
gil 257073290 |gb1GQ417009.1) 
gil 257073291|gb1GQ417010.11 
gi|257072963|gb1GQ416682.1) 
gil 223955061|gb| F658206.1) 
Brindha 


gi! 257073439|gb/GQ417158.1) 
gil 257073290 |gb1GQ417009.1) 
gi|257073291|ab!/GQ417010.1) 
gil 257072963 |gb1GQ416682.11 
gi|223955061|gb| F656206.1) 
Brindha 


gi|257073439|gb/GQ417158.1) 
gi|257073290|gb1GQ417009.1) 
gil 2570732911gb1GQ417010.11) 
gi| 257072963 |gb1GQ416682.1/ 
gi | 223955061 |ab| Fu6é56206.1) 
Brindha 


gi|257073439|ab/GQ417158.1) 
gil 257073290 |gb1GQ417009.1) 
gi|257073291|gb/GQ417010.1) 
gi| 257072963 |qgb|/GQ416682.1) 
gil 2239550611 gb| FJ658206.11 
Brindha 


9112570734391 gb/GQ417158.1) 
911257073290 I gb/GQ417009.1) 
gi! 257073291) gb|/GQ417010.1) 
gi 1257072963 igb|GQ416682.1/ 
gil223955061|gb| FU658206.1) 
Brindha 


ACTGGGCGTAAAGCGTGCGCAGGCGGTICGGAAAGAAAGATGIGAAATCC 547 


pert eter eter treet eit itt te tite e ttt tree e te teers) 


CAGAGCTTAACTTIGGAACTGCATTITIAACTACCGGGCIAGAGIGIGIC 629 
CAGAGCTTAACTTIGGAACTGCATTITTIAACTACCGGGCIAGAGIGIGIC 629 
CAGAGCTTAACTTIGGAACTGCAITITIAACTACCGGGCIAGAGIGIGIC 629 
CAGAGCTTAACTTIGGAACIGCATTITIAACTACCGGGCIAGAGIGIGIC 629 
CAGAGCTTAACTTIGGAACTGCATTTTIAACTACCGGGCIAGAGIGIGIC 649 
CAGAGCTTAACTTIGGAACTGCATTITIAACTIACCGGGCIAGAGIGIGIC 597 


err rrrrerersrT Terr errr rererrrr rrr rr rrrr rrr tT’ 


AGAGGGAGGIGGAATICCGCGIGIAGCAGIGAAATGCGTAGATATIGCGGA 679 
AGAGGGAGGIGGAATICCGCGIGIAGCAGIGAAATGCGIAGATATGCGGA 679 
AGAGGGAGGTGGAATICCGCGTGIAGCAGIGAAATGCGIAGATAIGCGGA 679 
AGAGGGAGGIGGAATICCGCGIGIAGCAGIGAAATGCGTIAGATATGCGGA 679 
AGAGGGAGGIGGAATICCGCGIGIAGCAGIGAAATGCGTAGATAIGCGGA 699 
AGAGGGAGGTGGAATICCGCGTGIAGCAGIGAAATGCGTAGATATIGCGGA 647 


GGARCACCGATGGCGAAGGCAGCCICCIGGGATAACACIGACGCICAIGC 729 
GGAACACCGATGGCGAAGGCAGCCTCCIGGGATAACACIGACGCICATGC 729 
GGAACACCGATGGCGAAGGCAGCCICCIGGGATAACACIGACGCICATGC 729 
GGAACACCGATGGCGAAGGCAGCCICCIGGGATAACACIGACGCICAIGC 729 
GGAACACCGATGGCGAAGGCAGCCICCIGGGATAACACTGACGCTICATGC 749 
GGAACACCGATGGCGAAGGCAGCCICCIGGGATAACACIGACGCICATGC 697 


ACGAAAGCGTGGGGAGCAAACAGGATIAGATACCCIGGIAGICCACGCCC 779 
ACGAAAGCGIGGGGAGCAAACAGGATTAGATACCCIGGIAGICCACGCCC 779 
ACGAAAGCGIGGGGAGCAAACAGGATTAGATACCCIGGIAGICCACGCCC 779 
ACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCTGGTAGICCACGCCC 779 
ACGAAAGCGIGGGGAGCAAACAGGATTAGATACCCIGGTAGICCACGCCC 799 
ACGAAAGCGTGGGGAGCAAACAGGATTAGATACCCIGGTAGICCACGCCC 747 


TAAACGATGICAACTAGCIGTIGGGGCCTICGGGCCTIAGTAGCGCAGCT 829 
TAAACGATGICAACTAGCIGITGGGGCCTICGGGCCTIAGIAGCGCAGCT 829 
TAAACGATGICAACTAGCTGTIGGGGCCTICGGGCCTIAGIAGCECAGCT 829 
TAAACGATGICAACTAGCIGITGGGGCCTICGGGCCTIAGIAGCGCAGCT 629 
TAAACGATGICAACTAGCIGTIGGGGCCTICGGGCCTTAGIAGCGCAGCT 849 
TAAACGATGTCAACT: CGGGCCTIAGIAGCGCAGCT 797 


3.3. Multiple sequence analysis 

Evolutionary relationship of the organism was identified using the tools clustalw and the Phylogenetic relationship between the 
organisms was identified using ATV software. The distance between the selected species was identified by dendrogram graphical 
representation and was shown in Fig.3. The phylogenetic relationship between the species was shown in Fig.4. 


3.3.1. Genbank Submission 
The identified sequence was directly submitted to Genbank (Fig.5). The Genbank provided accession number for the submitted 
nucleotide sequence. The accession number is BankIt1435630 Bordetella JF420884. 


3.3.2. Gene Prediction 

Functional region of the selected organism was identified by submitting the sequence to orf finder, Genemark and Promoter finder. 
The functional regions were shown in Fig. 6, 7 and 8. The Gene Mark graphical output depicts the coding potential in the six possible 
reading frames. An unbroken horizontal line at the 0.5 level indicates an open reading frame (ORF). Large vertical lines above the 0.5 
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level indicate “ATG” codons. Vertical lines under the 0.5 level represent one of the three stop codons, “TAA,” “TGA,” or “TAG.” The 
thick gray horizontal line indicates a region of interest. 
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Figure 4 
Phylogenetic relationships between the species using ATV software 


4. CONCLUSION 

There are many practical applications for identifying bacteria from waste water. One of the main responsibilities is to determine the 
identity of pathogenic bacteria. It is very important to determine the specific type of bacterium causing disease so that the physician 
able to correctly treat the patients. Bordetella sps. colonize the mammalian respiratory tract and cause whooping cough in infants 
and young children. It also causes hypoglycemia, increased histamine and endotoxin sensitivity. The toxin inhibits many leukocyte 
functions, including chemotaxis, phagocytosis and respiratory burst and impairs NK cell killing. Treatment is by routine acellular 
vaccine immunization. Present study investigates only to identify the pathogenic organism from waste water based on its molecular 
characters. 


SUMMARY RESEARCH 

1.Water plays a very important role in supporting all forms of life. However if it is contaminated, it has great a potential for 
transmitting a variety of diseases and illnesses to human beings via pathogens like bacteria. 

2.In the proposed work, DNA of Bordetella parapertussis was isolated and its molecular characterization was analyzed by 
bioinformatics tools. 
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FUTURE ISSUES 
1.Predict the 3D structure of the protein/ toxin which causes the disease whooping cough. 
2.Design a drug for the disease whooping cough using medicinal plants. 
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Functional region of the sequence using ORF Finder 
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Figure 7 
Coding region of the sequence using Gene Mark 


Promoter predictions for 1 prokaryotic sequence with score cutoff 0.80 (transcription start 
shown inlarger font): 


Promoter predictions for /tmp/readseq.in.d 338; 


Start End Score Promoter Sequence 


353 398 0.91 acttttggcaggaaagaaacggcgccggctaatacctggc(] ctaatgacg 


541 586 0.94 ctttggaactgcatttttaactaccgggctagagtotatcCgagggaggt 


Figure 8 
Promoter site of the sequence using Promoter Finder 
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